Three minimal sequences found in Ebola virus genomes and absent from human DNA

نویسندگان

  • Raquel M. Silva
  • Diogo Pratas
  • Luísa Castro
  • Armando J. Pinho
  • Paulo Jorge S. G. Ferreira
چکیده

MOTIVATION Ebola virus causes high mortality hemorrhagic fevers, with more than 25 000 cases and 10 000 deaths in the current outbreak. Only experimental therapies are available, thus, novel diagnosis tools and druggable targets are needed. RESULTS Analysis of Ebola virus genomes from the current outbreak reveals the presence of short DNA sequences that appear nowhere in the human genome. We identify the shortest such sequences with lengths between 12 and 14. Only three absent sequences of length 12 exist and they consistently appear at the same location on two of the Ebola virus proteins, in all Ebola virus genomes, but nowhere in the human genome. The alignment-free method used is able to identify pathogen-specific signatures for quick and precise action against infectious agents, of which the current Ebola virus outbreak provides a compelling example.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of First and Second Markov Chains Sensitivity and Specificity as Statistical Approach for Prediction of Sequences of Genes in Virus Double Strand DNA Genomes

Growing amount of information on biological sequences has made application of statistical approaches necessary for modeling and estimation of their functions. In this paper, sensitivity and specificity of the first and second Markov chains for prediction of genes was evaluated using the complete double stranded  DNA virus. There were two approaches for prediction of each Markov Model parameter,...

متن کامل

Minimal absent words in a sliding window & applications to on-line pattern matching

An absent (or forbidden) word of a word y is a word that does not occur in y. It is then called minimal if all its proper factors occur in y. There exist linear-time and linear-space algorithms for computing all minimal absent words of y (Crochemore et al., 1998, Belazzougui et al., 2013, Barton et al., 2014). Minimal absent words are used for data compression (Crochemore et al., 2000, Ota and ...

متن کامل

Profile of Eight Prophage Sequences Present in the Genomes of Different Acinetobacter baumannii Strains

ABSTRACT           Background and Objective: Prophage sequences are major contributors to interstrain variations within the same bacterial species. Acinetobacter baumannii is a gram-negative bacterium that causes a wide range of nosocomial infections, especially in intensive care unit inpatients. Prophage sequences constitute a considerable proporti...

متن کامل

The relationship between Human Papillomavirus and Epstein-Barr virus infections with breast cancer of Iranian patients

Background: Breast cancer is the malignancy in humans and other mammals. Several risk factors are involved in their appearance such as higher hormone levels and obesity. Identification of a mouse mammary tumor virus supports a viral etiology for breast tumors in animals. Viruses have been implicated in the development of various cancers, but viral induction for formation breast cancer is contro...

متن کامل

A Short Overview of Ebola Outbreak

  Ebola virus disease (formerly known as Ebola haemorrhagic fever) is a severe, often fatal illness, with a death rate of up to 90%. The illness affects humans and nonhuman primates (monkeys, gorillas, and chimpanzees). Ebola first appeared in 1976 in two simultaneous outbreaks, one in a village near the Ebola River in the Democratic Republic of Congo, and the other in a remote area of Sudan. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 31  شماره 

صفحات  -

تاریخ انتشار 2015